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ADAPTIVE ITERATIVE DETECTION 



CROSS-REFERENCE TO RELATED APPLICATIONS 



This application claims the benefit of U.S. 
Provisional Application Serial No. 60/176,951, filed on 
January 18, 2000, which is incorporated herein by reference. 



This invention relates to adaptive iterative detection 
schemes. 

To improve performance, coverage, and efficiency, 
modern wireless communication systems utilize digital 
signaling techniques. However, as these communication 
systems become more widespread, the problem of achieving 
clear reception in a noisy transmission channel becomes 
important. Cellular phone systems are one example of 
wireless communication systems and they also serve to 
illustrate the problems associated with such systems. For 
example, a typical disturbance phenomenon in a cellular 
phone system is known as inter-symbol interference. This 
interference may occur from fading, reflections, delays, or 
other disturbances from the cellular phone system, which 
typically involve mobility between the transmitter and 
receiver. 



BACKGROUND 
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In digital signaling, it should be possible to extract 
the "correct" symbol despite the presence of interference 
from other symbols using error control coding. However, 
this task is often complicated by poor knowledge of the 
characteristics of the transmission channel itself. For 
example, it is difficult to know just how cellular phone 
signals are being reflected, delayed, faded, and otherwise 
interfered with, as the cellular phone is moved around. 
Therefore, poor knowledge of the channel leads to 
uncertainty about integrity and accuracy of the received 
data. Accordingly, there are many known methods and systems 
that attempt to estimate the parameters of a transmission 
channel in order to better reconstruct a transmitted signal. 
However, in many practical situations, perfect channel 
estimation is not available at the receiver. Consequently 
an adaptive iterative receiver may deal with the unknown, 
and possibly time varying parameters. 

Recognizing the above-described difficulties, a new 
class of codes, referred to as turbo codes, has been 
introduced. This class of turbo codes has made signaling at 
power efficiencies close to the theoretical limits possible. 
The features of a turbo code include parallel code 
concatenation, non-uniform interleaving, and iterative 
decoding. Because turbo codes may substantially improve 
energy efficiency, they are attractive for use over channels 



t 
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that are power and/or interference limited. Furthermore, 
turbo codes also offer good performance over fading 



channels. A turbo decoder may be used to decode the turbo 
code. The turbo decoder may include two soft -input/ soft- 
5 output (SISO) decoding modules that work together in an 
iterative fashion. 

The SISO decoding module is the basic building block 
for established iterative detection (ID) techniques for a 
system having a network of finite state machines, or more 

10 generally, subsystems. However, performing iterative 

detection for systems having parametric uncertainty may be 
complex and difficult. Previously proposed SISO techniques 
are either based on an oversimplified channel model, or have 
complexity that grows exponentially with the observation 

15 length (or the smoothing lag) . 

SUMMARY 

In recognition of the above, the present disclosure 
describes a system for estimating inputs and outputs of a 
digital transmission system. The system includes a receiver 
20 front -end configured to receive an observed digital signal 
in the digital transmission system. The system further 
includes forward and backward recursion elements, at least 
one forward channel estimator, at least one backward channel 
estimator, and a combiner. 
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The forward recursion element is adapted to receive the 
observed digital signal, and generate a first sequence of 
soft information by performing a forward recursion. The 
forward channel estimator is adapted to receive the observed 
digital signal and the first sequence of soft information, 
and to estimate channel parameters using the first sequence 
of soft information. The backward recursion element is 
adapted to receive the observed digital .signal, and generate 
a second sequence of soft information by performing a 
backward recursion. The backward channel estimator is 
adapted to receive the observed digital signal and the 
second sequence of soft information, and to estimate channel 
parameters using the second sequence of soft information. 

The combiner is configured to compute a transitional 
information that ties forward and backward estimates of the 
first and second sequences of soft information together. 
Soft information on the inputs and outputs of the 
transmission system is then generated by combining updated 
first and second sequences of soft information and the 
transitional information . 

The present disclosure also describes an iterative 
receiver system. The receiver system includes a channel 
processor, a soft-in/soft-out decoder, and an 
interleaver/de-interleaver pair. 
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The channel processor is configured to receive a 
plurality of coded symbols, and to produce and update soft 
information on the plurality of coded symbols. The channel 
processor is activated by updated soft information on 
interleaved code symbols. The soft-in/soft-out decoder is 
configured to receive the soft -information on the plurality 
of coded symbols, and to compute soft information on the 
coded symbols. The interleaver/de-interleaver pair operates 
to pass the soft information to/from the channel processor 
from/to the soft-in/soft-out decoder. After several 
iterations, final bit decisions are made on uncoded bits by 
the sof t-in/sof t-out decoder by thresholding the 
corresponding soft information produced by the sof t-in/sof t- 
out decoder. 

The present disclosure further describes a method for 
estimating inputs and outputs of a digital transmission 
system. The method includes receiving an observed digital 
signal in the digital transmission system, generating a 
first sequence of soft information by performing a forward 
recursion, and estimating channel parameters using the first 
sequence of soft information and the observed digital 
signal. The method also includes generating a second 
sequence of soft information by performing a backward 
recursion, and estimating channel parameters using the 
second sequence of soft information and the observed digital 
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signal. A transitional information that ties forward and 
backward estimates of the first and second sequences of soft 
information together is computed. Soft information on the 
inputs and outputs of the transmission system is generated 
by combining updated first and second sequences of soft 
information and the transitional information. 



These and other features and advantages of the 
invention will become more apparent upon reading the 
following detailed description and upon reference to the 
accompanying drawings . 

FIG. 1A is a block diagram of an input sequence block 
encoded by a Convolutional Code (CC) , and an interleaved 
version of the input sequence encoded by a second CC. 

FIG. IB is an iterative detector network for perfect 
Channel State Information (CSI) . 

FIG. 1C is an A-SODEM-based iterative receiver. 

FIG. ID is an A-SISO-based iterative receiver. 

FIG. 2A is an iterative detector configured as a serial 
concatenation of convolutional codes (CCs) . 

FIG. 2B is an iterative detector network for perfect 

CSI. 

FIG. 2C is an A-SODEM-based iterative receiver. 
FIG. 2D is an A-SISO-based iterative receiver. 



DESCRIPTION OF DRAWINGS 
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FIG. 3A shows a transmitter portion of a conventional 
digital communication system. 

FIG. 3B shows a receiver portion of a conventional 
digital communication system. 



processing. 

FIG. 3D is a receiver based on an adaptive iterative 
detection principle. 

FIG. 4 illustrates a likelihood evaluation using 



FIG. 5 illustrates soft metric evaluation in the case 
of sequence-first combining.. 

FIG. 6 illustrates a Trellis-based adaptive soft- 
in/soft-out (A-SISO) technique with multiple or single 
estimator. 

FIG. 7 shows a serial concatenation of FSMs and the 
associated iterative detection network for the case of 
perfect CSI . 

FIG. 8 is a plot of BER vs. E b /N Q for system SI and 
various configurations for inner A-SISO. 

FIG. 9 is a plot of comparison between forward/backward 
and forward only inner A-SISO for system SI, for various 
values of the decision lag. 

FIG. 10 is a plot of BER vs. E b /N G for system S2 and 
various configurations for inner A-SISO. 



FIG. 3C is a block diagram of an optimal receiver 



forward/backward tree structures . 
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FIG. 11 is a plot of BER vs. E b /N c for systems S2 and 



S3 employing hard-decision and soft -decision decoding BER 
vs. E b /N D for the receiver employing adaptive and non- 
adaptive (using interpolated channel estimates) inner SISOs 



FIG. 12 is a plot of BER vs. E b /N Q for the receiver 
employing adaptive and non-adaptive (using interpolated 
channel estimates) inner SISOs for different payload sizes. 

FIG. 13 is a plot of BER vs. loop bandwidth for the 
10 SCCC static phase. 

FIG. 14 is a plot of. BER vs. E b /N 6 SCCC with phase 
dynamics and various inner A- SI SO and A-SODEM configurations 
(the optimal performance for SING receivers was achieved for 
d=0) . 

is FIG. 15 shows metric and parameter update for A-SISO in 

the forward direction for the case of multiple parameter 
estimation. 

FIG. 16 is a plot of BER vs. E b /N Q PCCC with phase 
dynamics and various inner A-SISO and A-SODEM configurations 
20 (the optimal performance for SING receivers was achieved for 



FIG. 17 illustrates a method for estimating inputs and 
outputs of a digital transmission system. 



5 



for different payload sizes. 



d=0) . 
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DETAILED DESCRIPTION 



In recognition of the above -described difficulties with 
prior iterative detection systems, the present disclosure, 
in one aspect, describes an adaptive iterative detection 
system based on adaptive soft-input/soft-output (A-SISO) 
techniques. The ' expressions for the soft metrics in the 
presence of parametric uncertainty modeled as a Gauss -Markov 
process are derived in a way that enables the decoupling of 
complexity and observation length. Starting from these 
expressions, a family of sub-optimal (i.e. practical) 

techniques is presented based on forward/backward adaptive 
processing with linear complexity in N. A-SISO techniques, 

as well as existing adaptive hard-decision techniques are 
interpreted as special cases within this framework. Using a 
representative application, several design option's are 
compared and the impact of parametric uncertainty on 
previously established results for ID with perfect channel 
state information is assessed. The representative 
application may include joint iterative equalization- 
decoding for trellis-based codes over frequency- selective 
channels, and channel estimation in turbo-coded systems. 
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INTRODUCTION 

The present disclosure presents iterative detection 
(ID) schemes for systems including multiple finite state 
machines (FSMs) . An iterative detection algorithm may be 

5 loosely defined as the set of rules to exchange, combine, 

and marginalize some sort of soft information related to the 
FSM input /output symbols, with the purpose of providing 
reliable decisions about the input sequence. Applications 
that utilize this scheme include turbo decoding of parallel 

10 and serial concatenated convolutional codes (PCCCs and 
SCCCs) decoding of trellis coded modulation (TCM) in 
interleaved frequency-selective fading channels as well as 
various multidimensional detection problems. The core 
building block in these iterative schemes include the soft- 

15 input soft-output (SISO) module. The building block may 
also include a novel technique similar to the Viterbi 
Algorithm (VA) that accepts a priori information on the 

input and output symbols of an FSM and outputs the 
corresponding a posteriori information, with complexity 

20 growing linearly with the record length. 

The codes are constructed as concatenations of simple 
constituent codes, and have been shown to achieve near- 
capacity performance. In a standard PCCC, shown in FIG. 1A, 
a block of the input sequence is encoded by a Convolutional 

25 Code (CC) , while an interleaved version of the input 
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sequence is encoded by a second CC. The two coded symbols 
are then mapped to the constellation points and transmitted 
over the channel. In some cases, the mapping may be 
performed after puncturing. 
5 When perfect Channel State Information (CSI) is 

available, a decoder that approximates Maximum Likelihood 
Sequence Detection (MLSD) performance with reasonable 
complexity can be constructed, by utilizing iterative 
detection (ID) . The effectiveness of ID may be attributed 

10 to the exchange of soft information related to the 

input/output symbols of each constituent CC. A systematic 
approach to designing ID receivers may be presented for 
systems having an interconnection of multiple subsystems. 
In this approach, soft inverse blocks - each one 

is corresponding to a subsystem in the original network - may 

be connected in an intuitive way to form the ID network. In 
general, the soft inverse requires marginalization of joint 

probabilities over all combinations of possible input/output 
sequences. This is the operation described, for example, by 
20 the Soft Mapper (SOMAP) soft inverse block. When the system 

is an FSM, however, the soft inverse may be computed 
efficiently using the so-called forward -backward Soft-Input 
Soft -Output (SISO) technique. Such a receiver is shown in 
FIG. IB for the case of PCCC with perfect CSI. This general 
25 view of iterative detection is broadly applicable and 

-11- 
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consistent with the view of message passing or belief 
propagation on graphical models. For details on this 
equivalence and the graphical approaches see Keith Chugg, 
Achilleas Anastasopoulos , Xiaopeng Chen, Iterative 

Detection: Adaptivity, Complexity Reduction, and 

Applications, by Kluwer Academic Publishers, 2001. 

Iterative detection (ID) may be utilized in various 
diverse applications. For example, ID may be utilized to 
decode Serially Concatenated Convolutional Codes (SCCCs) . 
As shown in FIG. 2A, in a SCCC the sequence of source bits 
is convolutionally encoded using an outer CC. These symbols 
are fed to an inner CC through a pseudo- random interleaver. 
The output symbols are then mapped onto the constellation 
points and transmitted to the channel . The corresponding 
iterative decoder may be constructed using soft inverse 
blocks, as in the case of PCCC, as shown in FIG. 2B. 

In many practical situations where perfect channel 
state information (CSI) is not available at the receiver 
(e.g., PCCCs and SCCCs with carrier phase tracking or TCM in 
fast frequency- selective fading channels) , an adaptive 
iterative detection (AID) scheme may be required to deal 
with the unknown, and possibly time-varying parameters. 
The present disclosure discloses a subclass of adaptive 
iterative receivers in which the parameter estimates are not 
exchanged as part of the iterative procedure. Rather, the 
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parameter estimates are generated and are confined inside 
the adaptive SISO (A-SISO) modules, which are the natural 
extension of the SISO modules for the case of parametric 
uncertainty. Nevertheless, the exchange of soft information 
on the symbols provides an implicit mechanism for the re- 
estimation of the unknown parameters as well. 

In a case of the unknown parameter being modeled as a 
Markov chain with finite number of states, the optimal A- 
SISO includes a modified SISO that runs on the augmented 
FSM. Another case may include the parameter being 
continuous in nature (e.g., phase offset or channel taps). 
Attempts to solve this more general problem were based on 
the Baum-Welch method (or equivalently the expectation 
maximization (EM) technique) . Since convergence to a 
locally optimal solution is possible, the optimality of the 
EM technique cannot always be guaranteed. In some cases, a 
Gauss -Markov (GM) model may be assumed for the unknown 
parameter and the optimal scheme may be derived. Starting 
from a different viewpoint, structurally similar techniques 
may also be derived for GM and deterministic parameter 
models, respectively. A sub-optimal A-SISO with a single- 
parameter estimator may further be developed. However, the 
inherent limitation of all the above approaches is that they 
all operate in a fixed lag (FL) mode; thus, two major 
conflicting goals in designing a practical technique are 



• PATENT 
Attorney Docket No.^B 1 666/060001/USC-2990 

coupled through a single parameter, the smoothing depth. 
Indeed, in an FL technique, a large decision delay 
(smoothing depth) is required to deliver reliable soft 
information. However, the same parameter determines the 
amount of pruning of the sequence tree and needs to be kept 
as small as possible, especially since it results in 
exponential complexity growth. 

In the present disclosure, meaningful soft metrics for 
the GM parameter preprocessing model are defined and 
expressions are derived in a novel way. This motivates a 
family of sub-optimal techniques leading directly to fixed 
interval (FI) schemes that have linear complexity with the 
record length N t as is the case for SISOs when no parametric 

uncertainty is present. The unique characteristic of the 
family of sub-optimal techniques may include the decoupling 
of complexity and smoothing depth. Furthermore, all 
existing A-SISO techniques for continuous valued parameter 
models may be viewed as forward only special cases within 

this framework. An application examined in the present 
disclosure describes TCM in interleaved frequency-selective 
fading channels. In this application, the effectiveness of 
the various A-SISO options has been assessed via extensive 
simulations. The impact of parametric uncertainty on 
previously established conclusions for iterative detection 
in systems including concatenated FSMs, is described. 
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ADVANTAGES OF ITERATIVE DETECTION 

FIGS. 3A through 3D illustrate one embodiment of a 
digital communication system. FIG. 3A shows a transmitter 
5 portion of the system, which may include channel coding, 
interleaving, and modulation of transmission signal. The 
receiver block diagram illustrated in FIG. 3B mirrors the 
processing performed in many practical receiver 
implementations. This segregated design paradigm allows 

10 each component of the receiver to be designed and 

"optimized" without much regard to the inner workings of the 
other blocks of the receiver. As long as each block does 
the job it is intended for, the overall receiver should 
perform the desired task: extracting the input bits. 

is However, despite the comprehensiveness of FIG. 3B, it 

may not be optimal from the standpoint of performance. More 
specifically, the probability of error for the bit estimates 
or bit-sequence estimate is not minimized by this structure. 
This segregated processing is adapted for tractability - 

20 both conceptual tractability and tractability of hardware 
implementation. The optimal receiver for virtually any 
system is conceptually simple, yet typically prohibitively 
complex to implement. For example, consider the 
transmission of 1000 bits through a system of the form in 

25 FIG. 3A. These bits may undergo forward error correction 

-15- 
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coding (FEC) , interleaving, training insertion (pilot, 
synchronization fields, training sequences, etc.), before 
modulation and transmission. The channel may corrupt the 
modulated signal through random distortions (possibly time- 
varying and non- linear) , like- signal interference (co- 
channel, multiple access, cross-talk, etc.), and additive 
noise. Thus, regardless of the complexity of the 
transmitter and/or channel, the optimal receiver would 
compute 2 1000 likelihoods and select the data sequence that 
most closely matches the assumed model. This is shown in 
FIG. 3C. Ignoring the obvious complexity problems, this 
requires a good model of the transmitter formatting and the 
channel effects. For example, the likelihood computation 
mentioned above may include averaging over the statistics of 
a fading channel model or the possible data values of like- 
signal interf erers . 

Data detection and parameter estimation are done using 
the entire global system structure. Unlike the direct 
approach in FIG. 3C, the iterative receiver in FIG. 3D 
exploits this structure indirectly. The key concept in this 
approach is the exchange and updating of "soft inf ormation"- 
on digital quantities in the system (e.g., the coded 
modulation symbols) . This concept is shown in FIG. 3D. The 
iterative detection receiver is similar to the conventional 
segregated design in that, for each subsystem block in the 
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model, there is corresponding processing block. In fact, 
each of these corresponding processing blocks in the 
receiver of FIG. 3C exploits only local system structure - 
e.g., the FEC decoder does not use explicit knowledge of the 
channel structure. As a consequence, the complexity of the 
receiver in FIG. 3D is comparable to the traditional 
segregated design in FIG. 3B (i.e., the increase in 
complexity is usually linear as opposed to the exponential 
increase in complexity associated with the optimal 
processing in FIG. 3C) . 

The distinction between segregated design of FIG. 3B 
and that of FIG. 3D, however, is that the processing for 
each sub-block in FIG. 3D is biased by some beliefs on its 
inputs and outputs. These beliefs (also referred to as 
reliabilities, soft decision, or soft information) are 
provided to each local processing unit by other processing 
units connected to it. These beliefs represent marginal 
soft information in that they are beliefs on the individual 
symbols as opposed to the entire sequence. The task of the 
processing unit is to update the beliefs on the input and 
output variables of the corresponding system sub-block in 
FIG. 3A. Each sub-block processing unit will be activated 
several times, each time biased by a different (updated) set 
of beliefs. 
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For example, suppose that a system using convolution 
coding and interleaving experience severe like -signal 
interference and distortion over the channel. In this case, 
the channel mitigation block in FIG. 3B will output hard 
decisions on the coded/ interleaved bit sequence a k . .Suppose 

that, given the severity of the channel, the error 
probability associated with these coded-bit decisions will 
be approximately 0.4. Deinterleaving these decisions and 
performing hard-in (Hamming distance) decoding of the 
convolution code will provide a very high bit error rate 
(BER) - i.e. nearly 0.5. 

For the receiver in FIG. 3D, however, the channel 
mitigation block produces soft-decision information on the 
coded/interleaved bit sequence a k . For example, this may be 

thought of as two numbers P[a^=l] and P [a k =0] that 

represent a measure of current probability or belief that 
the Jc-th coded bit a k takes on the value 1 or 0, 

respectively. Soft decisions contain more information than 
the corresponding hard decisions. In this example, it is 
possible that even though the hard decisions on a k 

associated with the receiver of FIG. 3B are hopelessly 
inaccurate, the soft decision information contains enough 
information to jump-start a decoding procedure. 
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A receiver of the form in FIG. 3D would pass the soft 
information through a deinterleaver to a soft -in decoder for 
the convolutional code. This decoder is a modified version 
that produces beliefs on the coded bits as well as uncoded 
bits. Thus, after activation of this decoder, a decision on 
the uncoded bits may be made. Alternatively, the updated 
beliefs on the coded bits may be interleaved and used in the 
role of a-priori probabilities to bias another activation of 
the channel mitigation processing unit in FIG. 3D. In fact, 
this processing may be repeated with the channel mitigation 
and FEC decoder exchanging and updating beliefs on the coded 
bits through the interleaver/deinterleaver pair. After 
several iterations, final decisions may be made on the 
uncoded bits by thresholding the corresponding beliefs 
generated by the code processing unit. This is referred to 
as iterative detection. 

A. Complexity Reduction 

Since the processing of FIG. 3D approximates the 
performance of the optimal processing with much lower 
complexity, iterative detection may be viewed as a 
complexity reduction tool. This complexity reduction is 
based on the partitioning or decomposition of the system 
into subsystems. Specifically, the complexity of the 
receiver in FIG. 3D is roughly the sum of the complexities 
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of the individual processing units, multiplied by the number 
of times each is activated (i.e. number of iterations). 
Since each of these processors uses only local structure, 
the associated complexity is roughly the same as the optimal 
decoder for that subsystem in isolation. For example, in 
the soft -decoder described in the above example has roughly 
the same complexity as the well-known Viterbi Algorithm 
(VA) . A key conceptual point is that the partitioning 
described is entirely arbitrary. The system may be 
decomposed into any number of equivalent block diagrams, 
each of which faithfully represents the system structure. 
This modeling choice, however, impacts the complexity of the 
associated processors in FIGS. 3B and 3D. For example, the 
receiver in FIG. 3C corresponds to a model without 
decomposition and is prohibitively complex. Furthermore, 
the notion that exchanging and updating soft information may 
be a replacement for hypothesis searching effort may be 
exploited aggressively to provide complexity reduction for 
systems that may not even have been modeled as a 
concatenation of subsystems. 

B. Adaptivity 

Acquisition and tracking of channel parameters (e.g., 
carrier phase and frequency, symbol synchronization, channel 
impulse response, etc.) may also be included into the 
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iterative processing of FIG. 3D. In the above example, the 



be unknown and/or t ime -varying . Therefore, the channel 
mitigation processor should estimate and track these 
parameters (possibly aided by some training signals) . In an 
iterative detector, it is possible to re-estimate these 
parameters at each activation of the channel mitigation- 
processing unit. The estimates are different for each of 
these activations because the reliability information 
biasing this estimation changes. Specifically, it is 
refined by the code processor using the structure of the 
code. Thus, through the passing of marginal soft 
information and iteration, the channel estimator indirectly 
uses the global system structure. A decision directed 
estimation processor may operate with the coding gain even 
when isolating interleavers are present. For an isolated 
system, this may be possible using joint parameter 
estimation and data detection (e.g. Per-Survivor 
Processing) . However, these adaptive iterative detection 
approaches are applicable to systems comprising concatenated 
subsystem (e.g. concatenated codes with interleaving). 



interference and distortion associated with the channel may 



SYSTEM AND CHANNEL MODEL 

Consider a typical time-division multiple-access (TDMA) 
cellular transmission system having a memory- less source 
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that feeds a convolutional code. The trellis-coded symbols 
are interleaved, mapped into a constellation, and pulse- 
shaped before transmission. The low-pass equivalent 
transmitted signal may be of the form 

00 

5 s(t) = JT^dtPit-kT ) (1) 

where d k is the coded symbol (normalized to unit 

energy), E s is the symbol energy, p(t) is the shaping pulse 

(normalized to unit energy) , and T is the symbol duration. 

This signal is distorted by a time-varying frequency- 
10 selective fading channel with impulse response of the form 

c(t,z) = Y j c„(t)8(T-nT r ) (2) 

where T r = T / N r with N r integer, and the dynamics of 

the vector random process c(t) = [c 0 (t ),..., c Lc (t)J T are assumed to 

be slow compared to the symbol duration T. The distorted 

is signal is observed in additive white Gaussian noise (AWGN) 
with power spectral density level N Q . 

co L c 

z(t) = y fi;Y d d kY d C n (t)p(t-kT -nT r ) + n(t) . (3) 

k = -co n=0 

Several options are available at the receiver front -end 
(FE) for preprocessing the received signal: low-pass 
20 filtering or match filtering with P*(-t), followed by 
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fractionally-spaced sampling every T 8 (where T S =T/N S , with 

N s integer) , followed by noise whitening (if necessary) . 

Regardless of the specific FE structure, the FE output may 
be modeled as an equivalent symbol -spaced vector inter- 
symbol interference (ISI) channel as follows: 

L 

*k = V^"S lk (n) + 5* ( 4a > 
= 4E^[d k ,...,d k _ L ]og k +n k 

= il °£* + 5* =il °( v £k) + Hk ( 4b ) 
where , n_ k and S_ k ( n ) are all N s -dimensional 

vectors, and U_k is complex, circular AWGN with 
independently, identically distributed components and 

E {\\Ekf J = N o • The N S (L C +1)- dimensional vector 



c k =[c 0 (kN s T s ),..., c Q ((kN s +N S -1)T S ) c L J(kN s +N S -1)T S )] T contains 

all the information relevant to the channel process c(t), 
while all the details of the pulse shaping and the FE are 
included in the matrix V. Finally, the N s (L + l)- 

15 dimensional vector g_ k = [gj k (0 ),..., §J k ( L )] T is the equivalent 

channel (which includes the effect of pulse shaping, 
channel, and FE) at time x, and the shorthand diamond ( o ) 

notation is used in equation (4b) to denote the mixed inner 
product implied by equation (4a) . 
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Equations (4a) and (4b) may be used under either a 
stochastic or an a-stochastic (i.e., deterministic) 
assumption for c(t) . An often used model for c kt and thus 

g_ k = Vc k is that of a Gaussian autoregressive -moving average 
5 (ARMA) process, generated by the plant equations 

where is a white noise sequence and the 
dimensionality of the state <j> is in general higher than 

c k . It has been shown that such model may adequately 

10 approximate realistic fading channels with non-rational 
spectrum. 

Although the specific FE processing (i.e., the matrix) 
may be important for making quantitative claims and for 
claiming optimality, qualitative conclusions about different 

is postprocessors remain the same for different FEs (e.g., per- 
survivor Processing (PSP) outperforms the conventional, 
adaptive maximum- likelihood sequence detector (CA-MLSD) ) . 
The present disclosure focuses on the introduction of novel 
post -processing approaches that are valid for any FE 

20 processing. However, in order to improve the readability of 
the development and to reduce the simulation effort, we 
focus on the following special case. In particular, 
regarding the channel model in equation (5) , a first -order 
GM model is adopted for c k , and thus g (i.e., $ k =c k is 
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assumed) . Furthermore, the concepts are illustrated using a 
simplified symbol-spaced scalar ISI model (i.e., N 8 = 1). 

Once the concepts introduced are understood, the techniques 
may be modified to account for the more general case of N 8 > 

1 and arbitrary plant model. 

In the following, a model for a generic FSM is 
presented. The output y k of a generic FSM can be defined as 

a function of its input x k and state s k - together 

constituting the transition t k = {s k , x k ) - through the 

equations 

y k = out(x k ,s k ) s k+l =ns(x kf s k ) (6) 

where each integer quantity u k (i.e., x k/ y k , s k or t k ) 
is assumed to take values in the set A u = {0, 1, . .., N u -l) . 
The output y k of the FSM is either used as an input to 

another FSM, or observed indirectly, through a function, 
which also involves the unknown parameter g k . Under the 

simplifying assumption mentioned earlier, the (L+l) - 

dimensional vector process {g k } evolves in time according to 

the equations 

S k = Gg + w k (forward) ^ 
8 k =G b gk-i + v k (backward) 

where w k , v k are zero-mean Gaussian vectors with 
covariance K w (m) = Q5 K (m) and K v (m) = Q b h K (m) , respectively. 
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Equation (4) may now be written as 



*k = f(ykf g k +"k = ?*g* +*k= m k +"k 



(8) 



where q k = f (y k ) is a complex vector depending on the 



modulation format (e.g., in the TCM system, /(•) maps the 



output of the inner FSM, which is the entire transition y k = 
t kf to the L + 2 constellation points, as shown in equation 



EXACT EVALUATION OF THE SOFT METRICS 

The objective of a SISO technique is to provide soft 
information about the input and output symbols of the FSM 
based on the observation record. This reliability 
information may either be in the form of an a posteriori 

probability or any other related quantity. It would be 
advantageous at this point to generalize the notion of the 
state s k and transition to longer sequence portions 

(e.g., a super-state and super- transition may be defined as 
s l=(tk-d.-"tk-i.Sk) and tl ^ih-d'-h) for arbitrary d) . This 
may also be viewed as the "trellis state" ("trellis 
transition") that is used in the receiver and need not be 
the same as that which defines the system model (e.g., 
encoder) . This foreshadows the result that the optimal 
techniques do not "fold" onto a trellis as in the case of 



(4) ) . 
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known channel and that the size of the trellis eventually 
used is a design parameter. For a generic quantity 
u k (i.e., x k ,y k ,s k t k ,s k ,t s k ,etc.) , we define the a posteriori 

probability (APP) and minimum sequence metric (MSM) soft 
outputs as follows: 

APP p (u k ) = P(u k \z n 0 ) 

= fc£/Y*o.*o"; (9a) 

x» Q :u k 
x n Q :u k 

MSM p (u k ) = -log[ max P(z n Q \x^)J 

x n 0 :u k 

= c'-log[ max P(z;,xS)J (9b) 
= c'-log[ max E e {P(zZ,xZ \@)J 

where x l :u k denotes all input sequences consistent 
with u kf and c, and c' are normalizing constants. These 

soft outputs are the direct generalizations of well-known 
soft outputs for perfect CSI to the case of an unknown 
parameter 0 . When the SISO module is part of an iterative 
receiver, the soft output is usually normalized to the a 

priori information resulting in the so-called extrinsic 
information (e.g., APP p (u k ) / P(u k ) , or MSM p (u k ) - (- log P(u k )) 
is used in place of APP p (*) or MSM p (*) , respectively) . We 
observe that in both cases, the soft outputs may be derived 
from the quantity E q {P(zq ,x£ \ ®)] by either averaging or 
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maximizing - for APP p (m) or MSM p (m) , respectively - over 



Equation (9) suggests a way of manipulating P(z%,xg\®) 
to obtain the proposed soft metrics. Maintaining the 
conditioning over the entire input sequence, expectation may 
be performed on the unknown parameter. Combining of the 

resulting metrics over the nuisance parameters Xq :u k is 

performed as a final step, leading to the final two soft 

metrics for u^. Since operators :u k an< ^ commute, an 

additional choice is available for the evaluation of the 
metric in equation (9a) . Here, the sequence combining is 
done initially, followed by the parameter elimination. 
Different soft metrics can also be defined by interchanging 

the max(Xo :u k ) operator with the E0 operator in equation 

(9b) . This option will not be pursued in this work, mainly 
because it does not appear to lead to rigorously expressed 
optimal structures . 

A. Parameter-First Combining 

Optimal techniques for the evaluation of the soft 
outputs defined in equations (9a) and (9b) , and more 

precisely the quantity P(z£,jtJ) are derived. It is noted 

once more that these techniques are optimal for a given FE 



the nuisance parameters *o ; u 
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processing at the receiver. The obvious approach is a 
straightforward evaluation of this likelihood for each of 
the (N x ) n+1 input sequences. The procedure is concluded with 
the appropriate combining of these quantities (summation or 
maximization for &PP p (u k ) or MSM p (uj , respectively). This 

type of processing is based on the fact that the likelihood 

P(zq,Xq) may be computed recursively 



function of a complex circular Gaussian random variable with 
mean m, and variance a 2 / 2 for the real and imaginary part, 



corresponding covariance matrix generated by a sequence - 
conditioned Kalman filter (KF) . This technique, although 
efficient, results in sub-optimal techniques where 
complexity and smoothing depth are exponentially coupled, as 
mentioned above. 

An alternative optimal procedure for the likelihood 
calculation is now described. Future observations depend on 
past observations conditioned on the state of the FSM 
because of the presence of the parameter process {g k } . 

However, by conditioning on the parameter g k as well, 



P(2 k 0 , X* ) = P(2 k | Z*-' , X* )P( Xk )P(Z*- 1 , X*-' ) 

= N(z k ;q T k gk\k- 1; N 0 +q T k Gk\k- \q k ) 
xP{x k )P(z k Q -\xt- x ) 



(10) 



where N(z;m;er 2 ) denotes the probability density 



while g k \ k _ x and G k 



are the channel one -step prediction and 
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separation of the future and past observations occurs, 
yielding equation (11) . 



The relation in equation (11) and subsequent analogous 
expressions are the basis for the practical techniques 
proposed in the next section. It indicates that the 
likelihood may be split into three factors, of which the 
first two depend each on the past/present and future, 
respectively. The third may be viewed as a weighting factor 
that binds them together. Thus, the third factor quantifies 
the dependence of the future, present and past that is 
introduced due to the parameter process {g k } and in the 

absence of parametric uncertainty would be eliminated. An 
alternative interpretation may be offered by realizing that 
the expression in equation (11) is closely related to the 
total mean square error of a sequence-conditioned Kalman 
smoother. A closed- form expression may be found for the 
binding factor since it involves an integral of Gaussian 
densities. The first factor in equation (11) is recursively 
evaluated using equation (10) , while the second factor is 
calculated through a similar backward recursion. 



P(z n 0 ,x" 0 ) = P(z k 0 , x k 0 )P(z" M , x" k+l I s M ) I 



ng k \x k 0 ,z k )P(g k \s k+i ,x n M ,z" k+l ) 

P(g k ) 



dg k 



(11) 
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(12) 



~ N(z k+ i ; q k+ i gk+\\k+2 1 N 0 + q k+l G k+ ^ k+2 q k+} ) 



xP(x k+l )P(z i 




The scheme suggested by equations (10) - (12) is 
illustrated in FIG. 4 and may be described as follows. 
Starting at time 0, a forward N x -ary tree is built, each 

node of which represents a sequence path. The likelihood 



stored in each node. At each time k, the tree is expanded 

forward and the probabilities corresponding to the newly 
branches are calculated using equation (10) . It is implied 
from this equation that a KF that depends on the entire path 
history is required to complete the recursion. Similarly, 
starting at time n, a backward tree is expanding according 

to the recursion equation (12) . The relevant channel 
estimates are provided by a per-path backward running KF. 
After k+1 forward and n-k backward steps, the two trees meet 

each other. The likelihood of each sequence x% may now be 
evaluated as indicated by equation (11) . The (N x ) k+1 

likelihoods corresponding to the nodes of the forward tree 
are combined with the (N x ) n ' k likelihoods corresponding to 

the nodes of the backward tree (future) and weighted by the 



P&o \x kX ), together with g k \ k _ x and G k [ 



of that path are 
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binding factor in equation (20) . The final soft output for 
a generic quantity u m is the summation (or maximization) 

over all factors with the same u m . Note that the choice of 

k, the particular point in time when the past and future 

metrics are combined, is completely arbitrary (i.e., it is 

not related to m) . In fact, the two extreme values k=n and 

k=0 correspond to a single forward or a single backward 

tree. In a practical technique, however, the reference 
point k is chosen to be in the neighborhood of m, in order 

to maximize the number of relevant sequences combined to 
produce the soft information on u m . Thus, while it may seem 

redundant to store and update both a forward and a backward 
tree (i.e., same result can be accomplished with a single 
forward tree) , the fact that the two trees may be pruned* 
independently, decouples complexity and observation length, 
leading to practical techniques, as will be discussed below. 

B. Sequence -First Combining 

The special form of APP p (u^) allows us to obtain 

alternative expressions for the optimal soft outputs by 
realizing that the expectation operators in equation (9a) 

may be interchanged, to obtain APP p (u k ) = cPQ 9 u k ) . In 
particular, a straightforward expression for APP p (tJ may be 
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derived by utilizing the fact that the process {(t^gr*)} is 

a mixed-state Markov chain. Unfortunately, the storage 
requirement for these recursions may be infinite due to the 
fact that g k takes values in a continuous space. Although 

5 it is conceivable to quantize the channel values, another 

approach may be followed. A derivation similar to equation 
(11) leads to equation (13) . 

^ ^ y 

M-) 

The forward and backward recursions for the first two 
10 quantities are as follows: 

Aside from the evident similarity of equations (13) and 
(14) with equations (11), (10), and (12), there are two 
important differences as follows: 

xP(z k \t k ,z k 0 - l ,x k ) 

P( Z k+l\ S k+l) = 5j P( Z k+l I h+\> Z k+l) 

(lib) 

xP(x k+l )P(z n k+2 \s k+2 ) 

is 1) the recursions described here do not depend (at 

least explicitly) on the entire path history and 

2) the evaluation of the third factor of equation (13) 
as well as the innovation factors in equation (14) is 
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complicated due to the fact that they are mixed-Gaussian 
densities. Nevertheless, assuming that the latter 
difficulty may be overcome, the technique suggested by 
equation (13) and (14) is much simpler: only a forward and 
5 backward recursion is performed over a state trellis, 

followed by a combining (multiplication) of the updated 
quantities with an appropriate weight (third factor) . This 
procedure is depicted in FIG. 5. The generalized states 

^and transitions t s k may be used with the corresponding 
10 updating equations unchanged. 

C. Comments on the Deterministic Parameter Model 

In the case when the unknown parameter is modeled as a 
deterministic constant, and expectation over the unknown 0 
is is not feasible, a reasonable soft output choice is 

APP d {u k ) = c2>axP(z 0 ^ 0 " |0) (15) 



0 



MSM d (u h )^c'-\o% 



maxmaxPOroX |0) 

4:u k 0 



(16) 



The development of the exact expressions for this 
modeling option is similar to that associated with the GM 
20 channel. The resulting expressions are structurally similar 
with the main difference being the channel estimator, which 
is a recursive least-squares (RLS) estimator instead of the 
KF. Similarly to the GM case, by exchanging the order of 
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maximization in equation (16) , sequence-first expressions 
may be developed as well. 

SUB -OPTIMAL (FIXED -COMPLEXITY) ALGORITHMS 

The exact evaluation of the soft metrics developed 
above involves likelihood updates on a forward and backward 
tree, assisted by per-path filters, followed by binding of 
the past and future metrics. Accordingly, any sub-optimal 
technique for the case of parameter- first combining may be 
interpreted as the result of applying one or more of the 
following simplifications : 

1) non- exhaustive tree search; 

2) non-Kalman channel estimators; and 

3) sub-optimal binding of the past and future metrics. 
Similarly, for the case of sequence- first combining, any 
sub-optimal technique is the result of a simplifying 
assumption for the innovation factors, as well as a simpler 
form for the channel estimators and binding factor in 
equation (13) . This design space is partially explored 
below. 

A. Parameter-First Combining 

1) Tree-Search Techniques: Regarding the tree search, 

many options are available to prune the sequence tree. 
Since breadth- first schemes maintain a common front in the 
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search process, the schemes. may be appropriate for soft- 
decisions. This facilitates the combining task. One such 
technique is the VA, which maintains and updates -through 
the familiar add compare select (ACS) operations- a fixed 
number of paths in such a way that they are forced to have 
different recent paths. A technique for evaluating MSM ( • ) 
metrics, proceeds by extending and eliminating paths in the 
same way as in the hard-decision case, while the completion 
is performed by minimizing the corresponding transition 
metrics. The formulation of a practical technique for 
calculating APP ( - ) metrics involves summation of the 
sequence metrics as well as tree pruning. A technique that 
combines these two tasks may be derived employing either the 
PSP principle, or equivalently, the decision-feedback (DF) 
assumption. 

The resulting FI techniques, shown in FIG. 6, include a 
single forward and backward recursion over the entire 
observation record, similar to the ones performed in the 
classical SISO. Product sum or ACS operations are performed 
for the metric updates, for APP or MSM soft metrics, 
respectively. A KF channel estimate is kept for every 
trellis state and updated in a PSP fashion. The soft 
outputs for x k and y k are derived from the soft output of 

the transition t k . The latter is computed as the product 

(sum) of the forward metric of the starting state s k , the 
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transition metric of t*, the backward metric of the ending 

state s k+1 , and the binding factor corresponding to t*. 

However, the trellis on which this technique operates is not 
tightly related to the FSM trellis. The trellis size is a 
design parameter that determines the amount of pruning in 
the forward and backward trees, and eventually, the 
complexity of the technique. 

2) Channel Estimate and Binding Factor Simplification: 

Any near-optimal receiver has to search over as many paths 
as possible for a given amount of resources, so it is 
desirable to reduce the complexity associated with the 
metric updates and in particular the channel estimates. One 
such simplification is to substitute KF channel estimation 
with the least mean-squares (LMS) technique, so no matrix 
storage and update is required. This simplification may be 
derived in a more rigorous manner for the case of 
deterministic parameter model, resulting in a simple and 
insightful expression for the binding factor shown in 
equation (17) . 



The above expression may be interpreted as follows. If 
the forward and backward channel estimates corresponding to 
a particular sequence are not consistent, a penalty is paid 
by means of increasing the sequence metric. 



(17) 
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B. Sequence-First Combining 

1) Metric Simplification : Starting from equation (14) , 

sub-optimal techniques may be derived by employing a 
5 simplifying assumption for the innovation factors 

p ( z k K> 2 o _1 ) / P( z k + \ l r jt+P z Jfc+2) • The Gaussian approximation for the 
above innovation terms leads to an attractive technique 
since only the state -conditioned/ sequence -averaged forward 

(i.e., gk\k-\( s k)- E(Sk \ s k> z l~ X ) ) and backward channel one-step 
10 predictions together with the corresponding covariances need 
to be maintained and updated. Note that these estimates are 
only partially conditioned on the data sequence through the 

state s k (or more generally the super-state s s k ) . Recursive 

update equations for these partially conditioned (PC) 
is channel estimates are very similar to the KF recursions, 
thus we use the name PCKF. Furthermore, in the limiting 
case when the super-state represents the entire sequence, 
the innovation factors become precisely Gaussian and the 
PCKF becomes the sequence -conditioned KF; this is the exact 
20 scenario of the parameter-first combining in the GM case. 

Under the Gaussian assumption, a closed- form expression for 
the binding factor in equation (13) may be derived as well, 
resulting in a function similar to b p (.). 
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2) Further Channel Estimator Simplification: In 

addition to the Gaussian approximation, a further 
simplification occurs under the assumption that the 
conditional means and covariances of. the channel are not 

5 functions of the states E(g k \ ^,z 0 w ) « E(g k | z*" 1 ) = g k{k _ x . This 

approximation may result in a desirable solution, since only 
a single forward and a single backward global estimator 
(aver aged over the sequence) needs to be maintained and 
updated. Assuming that a probabilistic description P'(t k ) is 
10 available for the transitions t kl a recursion can be derived 

for g k \ k „ x . The application of this single-estimator idea is 
inhibited, since 1) the above approximation is not valid and 
2) an accurate P\t k ) may only be derived from the 

observation z\ and is therefore tightly coupled with the 
is estimation process. Both 1) and 2) are alleviated by 

introducing a delayed (advanced) by d channel estimate to 

evaluate the forward (backward) transition metric at time k, 
since by increasing the decision delay d, the accuracy of 
the approximation 



E{g k .AhX- d - X )^E{g k ^A- d - x ) 



(18) 



20 



— Sk-d\k-d-\ 



is improved. The resulting recursion equations, 



summarized in the Appendix, closely resemble those of the 
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KF. The intuitive justification of this technique is that 
since a probabilistic description of ^. d --and consequently 

>> w --exists, an average q k ^. l =^t k _ d q k _ d P'(t k . d ) may be used in 

place of q k , d in the KF recursions, thus resulting in what we 

5 refer to as an average KF (AKF) . The resulting A-SISO, that 
utilizes a d-lag (d-advanced) soft-decision-directed forward 

(backward) AKF, is depicted in FIG. 6, and proceeds as 
follows. The forward metrics at time k are updated as in 

equation (14a) using the d-delayed channel estimate g k _ dlk _ d ^ . 

10 Starting at time k, a d-step non-adaptive backward recursion 
is performed, at the end of which, a smoothed soft metric 
(P\t k _ d ) = P(t k _ d \z k 0 ) is obtained. The latter is now used in the 
AKF to update g k . dlk , d ^ . A similar one-step adaptive 
backward/ d-step non-adaptive forward recursion is required 

is for the update of the backward quantities. 

TCM IN INTERLEAVED FREQUENCY -SELECTIVE FADING CHANNELS 
A. Receiver Structures 



As mentioned above, the TCM system may be modeled as a 



20 



serial concatenation of two FSMs - the outer TCM encoder and 



the inner ISI channel - through the interleaver. 
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Three receiver types may be identified for the case of 
perfect CSI. They included the traditional hard-decision 
Viterbi equalizer (VE) followed by a Viterbi decoder (VD) , 
as well as the more sophisticated iterative structure shown 
in FIG. 7. An adaptive receiver may be derived in a 
straightforward way from the non-adaptive version, by 
replacing the inner detector (i.e., the equalizer) with its 
adaptive equivalent, while leaving the outer detector (i.e., 
the decoder) intact . 

Although there are many possible A-SISOs arising from 
the framework in above section for the soft metrics, only 
trellis-based techniques are utilized. Several notes on the 
details of the implementation follow. 

1) APP techniques operating in the log domain, result 
in a small complexity increase compared to MSM. 
Furthermore, all APP techniques may be constructed from 
their MSM counterparts by replacing the function in the ACS 
operation by min*=min (x,y) -log (1+exp ( - | x-y\ ) ) . 

2) Trellis-based multiple-estimator structures store 
and update one estimator per state with zero delay, while 
single estimator schemes require d backward steps- for every 

forward step- to provide reliable tentative soft or hard 
data estimates to update their single estimator. 
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3) Regarding the particular channel estimator used, the 
complexity increases in the order LMS, RLS, KF, AKF, PCKF, 



4) Optimal binding is, in general, a costly operation 
(see equations (20) and (21) in the Appendix), while the 
sub-optimal binding proposed in equation (17) results in a 
small increase in the adaptive SISO complexity. 

5) Forward- only techniques have significantly lower 
requirements in computation and memory than forward/backward 
techniques with the same number of states, since they do not 
require the additional backward recursion and binding. As 
was discussed above, however, the exponential dependence of 
complexity and smoothing depth is expected to give rise to 
much higher overall requirements for forward-only 
techniques, if the performance of forward/backward 
techniques is to be obtained. 

B. Numerical Results and Discussion 

Simulations were run for a transmission scheme 
comparable to GSM. The convolutionally encoded sequence is 
interleaved using a 57x3 0 block interleaver. Each 
interleaver column is formatted into a TDMA burst together 
with a training sequence, equally split in 13 leading and 13 
trailing symbols. Each burst is modulated and sent over a 
three-tap equal power Rayleigh fading channel (each tap is 



with the KF and the AKF having almost equal complexity. 
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assumed independent from the others) with normalized Doppler 
spread v d =0.005. Referring back to the generic model in 

equation (4b) , the above-described scenario corresponds to a 
system with root-raised cosine pulses, symbol-spaced 
independent fading taps (i.e., T r -T) and a whitened-matched- 

filter symbol-spaced (i.e., T S =T) receiver FE. Although the 

decorrelation time of such a channel is much larger than 57 
symbols, for the purpose of simulation efficiency, a smaller 
interleaver depth is used in conjunction with the assumption 
of burst-to-burst independent channel. 

Three systems are considered as follows: i) a rate 1/2, 
16-state coded QPSK system (SI); ii) a rate 2/3, 32-state 
coded 8PSK system (S2) ; iii) an uncoded QPSK system (S3) . 
Regarding the naming of the presented techniques, each 



technique is identified by a four-part label, each part of 
which denoting: 1) the type of the soft decision (i.e., APP 
or MSM) ; 2) the multiplicity of the channel estimators 
(i.e., SING or MULT); 3) the particular channel estimator 
used (i.e., KF, RLS, LMS, AKF) ; and 4) the binding method 
(i.e., optimal binding (OB), sub-optimal binding (SB), or no 
binding (NB) ) . The trellis size of all techniques 
considered here is chosen to be the same as the size of the 
underlying FSM trellis. 

FIG. 8 presents performance curves for system SI, 
employing the iterative receiver described in the previous 
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section with different A-SISOs for the inner equalizer. 
Bit-error rate (BER) curves for the first and fifth 
iteration are shown; no significant improvement was observed 
for more than five iterations. For the A-SISOs employing KF 
or AKF, the channel estimators were obtained by 
approximating the Clarke spectrum with a first order model 
having 10 -dB bandwidth equal to v d . Comparing the two 

curves corresponding to MSM-MULT-LMS, a loss of 2 dB (1 dB) 
is observed for the fifth (first) iteration when no binding 
is performed. This outcome clearly indicates the 
significant practical value of the binding factor. The 
comparison between MSM - MULT - LMS - SB and MSM - MULT - KF - OB shows 
that LMS channel estimation with sub-optimal binding is 
nearly as good as the KF with optimal- and computationally 
expensive -binding. In the first iteration, the latter 
performs slightly better (by 0.7 dB at BER=10" 3 ) , while in 
the fifth iteration, no notable difference is observed. 
Multiple-estimator schemes are shown to be 2- 4 dB better 
than single-estimator counterparts in the first iteration, 
while this gain is decreased to 0.5- 2 dB after the fifth 
iteration as can be observed from the comparison of MSM- 
MULT-LMS-SB and MSM - MULT -KF- OB with MSM-SING-LMS-SB or APP- 
SING-AKF-OB. Note that the optimal value for the tentative 
delay was found to be d=3 for both SING estimators. The 

best A-SISO achieves performance that is just 1 dB away from 
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that of perfect CSI. Regarding the iteration gain, as much 
as 6- 7 dB can be gained using five iterations for both 
single-or multiple-estimator SISOs. This result is the 
direct antithesis with the perfect CSI case, where an 
iteration gain of only 1 dB does not even justify the need 
for ID. Finally, simulation results confirm the negligible 
difference between APP and MSM techniques for these 
operational SNRs. 

In FIG. 9, the performance of MSM - MULT - LMS - SB of FIG. 5 
is compared with that of the corresponding receiver 
employing a forward-only A-SISO with decision delays and 
symbols. Other than the different inner A-SISOs, all other 
components of the compared receivers are identical. As 
expected, performance is improved by increasing the 
smoothing depth D, but gives rise to exponential complexity 

growth. The comparison with the proposed A-SISO shows that 
even with a high complexity forward-only technique (D=5 

corresponds to a 1024-state trellis) a performance gain of 
1-1.5 dB may be achieved with the FI A-SISO with only a 
fraction of the complexity (a forward and a backward 
recursion on a 16-state trellis is required) . 

Similar performance curves are reproduced in FIG. 10 
for system S2 over the same channel as in the previous 
simulation. The presence of the denser 8-PSK constellation 
produces quantitatively different performance curves. 
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Single -estimator schemes reach an error floor at BER values 
greater than 10~ 2 , regardless of the channel estimator used 
(i.e., LMS or AKF) . Multiple estimator techniques using 
either KF and OB or LMS and SB perform almost identically at 
5 BERs smaller than 10" 2 . Both of these adaptive techniques 
yield much worse performance compared to perfect CSI (the 
loss is on the order of 5 dB for the fifth iteration for the 
best A-SISO at BER of 10" 3 , while is reduced to 
approximately 3 dB for a BER of 10" 5 ) . 

io Coded modulation techniques have been considered as a 

method to provide improved performance (i.e., coding gain) 
with the only cost being increased receiver complexity 
(i.e., no bandwidth expansion). The design tradeoffs for 
this frequency selective channel are more complex than those 

is for an ideal AWGN channel . Those tradeoffs may be studied 
under the perfect CSI assumption. FIG . 11 presents a 
comparison between systems S3 (uncoded QPSK) and S2 (8PSK- 
TCM) , both having the same throughput and occupying the same 
bandwidth. In the AWGN channel, S2 provides a 4.6-dB gain 

20 over the uncoded system. Conclusions are obtained for the 

case of perfect CSI: coding gain without bandwidth expansion 
is not possible using hard-decision receivers. The 
utilization of soft-decision receivers results in 4-dB 
coding gain at a BER of 10" 3 for the first iteration. 

25 Additional iterations slightly improve the performance, 
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resulting in 5.5-dB gain at the fifth iteration. When 
perfect CSI is not available, and adaptive processing is 
performed, the hard-decision PSP receiver still cannot 
provide any performance improvement over the uncoded system. 
Furthermore, the adaptive soft -decision techniques, provides 
a poor coding gain when only a single iteration is performed 
(i.e., 3.5 dB) . However, the use of iterative soft-decision 
adaptive processing results in a gain of approximately 13 
dB. 



C. Factors Impacting Performance 

The conclusions drawn in the previous section are 
tightly coupled with the particular channel conditions and 
system configuration, and may be significantly altered when 
different operating conditions are considered. One channel 
characteristic, that has a significant effect on receiver 
design, is the level of dynamics (measured by the normalized 
Doppler spread v d ) . While high dynamics were considered 

here, in the case of low dynamics, the need for adaptive 
processing is questionable; an initial channel estimate may 
suffice for use in conjunction with a non-adaptive iterative 
detector. Similar conclusions have been drawn for adaptive 
hard-decision techniques. 

The signaling format, and in particular the 
configuration of the training sequence, is another system 
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characteristic that has a great impact on receiver design. 
When only a leading training sequence is available, a more 
reasonable choice is to use an FL A-SISO. The design of an 
FL A-SISO is not a trivial extension of the FI scenario 
presented here, and is a topic of current research. 

Regarding tracking versus acquisition operating mode, a 
relevant measure is the product of the payload (i.e., burst) 
size J with the normalized Doppler spread of the channel v d 

(P=Jxv d ) ; the smaller the value of P, the lower the 

probability of loosing lock. For systems operating with 
small P values, and utilizing leading/trailing training, a 

low complexity non-adaptive SISO technique derives channel 
estimates by linear interpolation between the initial and 
final channel estimates. In FIG. 12, the performance of 
this scheme is compared with that of MSM-MULT-LMS-SB for 
system SI. It is shown that the interpolator based non- 
adaptive SISO operates with 1-dB degradation compared to 
MSM - MULT - LMS - SB for a BER of 10" 3 and payload size J=57 . 

Unfortunately, such high-performance/low-complexity A-SISO 
is not feasible when either a trailing training sequence is 
unavailable or when the value of is increased. The latter 
is demonstrated in FIG. 12, where the doubling of payload 
size results in catastrophic performance for the 
interpolator based SISO. 
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FIG. 17 illustrates a method for estimating inputs and 
outputs of a digital transmission system. The method 
includes receiving an observed digital signal in the digital 
transmission system (at 1700) , generating a first sequence 
of soft information by performing a forward recursion (at 
1702) , and estimating channel parameters using the first 
sequence of soft information and the observed digital signal 
(at 1704) . The method also includes generating a second 
sequence of soft information by performing a backward 
recursion, at 1706, and estimating channel parameters using 
the second sequence of soft information and the observed 
digital signal, at 1708. A transitional information that 
ties forward and backward estimates of the first and second 
sequences of soft information together is computed at 1710. 
At 1712, soft information on the inputs and outputs of the 
transmission system is generated by combining updated first 
and second sequences of soft information and the 
transitional information. 

ADAPTIVE SOFT DEMODULATOR 

An Adaptive -Soft Demodulator (A-SODEM) technique 
provides an alternative approach to the A-SISO technique in 
building soft inverse blocks. In fact, both are special 
cases of the adaptive soft inverse with the A-SODEM term 
used, for a single state A-SISO. Under this technique, the 
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estimation process is tied to the detection process through 
the exchange of soft or hard information between the A-SODEM 
and the rest of the ID network, as shown in FIG. 1C and 2C, 
for PCCC and SCCC, respectively. 

In this section, the above approaches to Adaptive 
Iterative Detection (AID) for phase tracking in PCCC 
and SCCC systems are considered. A key step is the 
development of the appropriate Adaptive Soft Inverse 
(ASI) techniques for phase uncertainty. Initially, 
recursions for the exact ASI for an FSM with 
observation that depends linearly on an unknown 
parameter vector is derived. Approximate ASI 
techniques for this problem is then motivated. The 
techniques have the form of an adaptive forward 
backward technique executed on a fixed-size trellis. 
ASI modules for phase tracking may be defined. This 
may yield practical A-SISO and A-SODEM techniques for 
phase tracking that are used as described above. 
Finally, these techniques for AID for phase tracking 
in SCCC and PCCC are applied. The results demonstrate 
that the substantial gains in performance associated 
with turbo codes relative to traditional convolutional 
coding are maintained in the presence of severe phase 
dynamics when a well -designed AID scheme is used. 
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THEORETICAL FRAMEWORK FOR AST TECHNIQUES 

Consider a system with input x k e A x ={0X-..,N X -1} , 
and output at time k observed indirectly, through a 

function which also involves an unknown vector- 
parameter gr. Furthermore, assume that the observation 

z k is a linear function of g: 



where n k is complex circular Additive White Gaussian 
Noise (AWGN) with E{\n k \ 2 } = N Qf and q k =f(x n 0 ) is a size L 

complex vector depending on the modulation format, drawn 
from a constellation of size Q, and possibly depending on 

the entire input sequence Xq . 

Based on the sort of information exchanged, a soft 
inverse technique can be classified as a posteriori 
probability (APP) or Minimum Sequence Metric (MSM) . 
Whenever perfect CSI is not available at the receiver, 
similar quantities may be evaluated by either averaging, or 
maximizing over the unknown parameter. For a generic 
quantity u kf and assuming that g is modeled as a 

deterministic unknown constant vector, we define the APP and 
MSM ASI as a system that outputs 



(19) 
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def 



APP(u k )^cXmaxP(z n 09 x n 0 \g) 



(20a) & (20b) 



MSM(u k ) = c'- log max max P(z n Q , x n 0 \ g) 



4-.u k g 



where x%:u k denotes all sequences jcJ consistent with u*, 

and c,c' are normalizing constants. These soft outputs are 
direct generalizations of the well-known perfect-CSI soft 
outputs for the case of unknown deterministic parameters. 
In substantially all cases, the soft outputs may be derived 

from the quantity max g P(Zq, Xq \ g) by either averaging or 
maximizing - for APP(-) or MSM(-) respectively - over the 
nuisance parameters jCqIW^. 

A . Exact evaluation of the AST for FSM: A-SISO technique 

In this section, the derivation of the exact ASI for an 
FSM with observation that is a linear function of an unknown 
parameter vector is summarized. Specifically, bi- 
directional recursions for the quantities in equation (20) 
may be obtained. 

Consider an FSM with input x k , state s k - together 

constituting the transition t k ={s k ,x k ) - and output y k , where 
each integer quantity u k (i.e., x kf y kf s kf or t k ) is assumed 
to take values in the set . The observation equation is 
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given by equation (19) specialized to q k = f(x k ,s k ) = f(y k ) , where 
q k depends on the input sequence only through the FSM output 

For the evaluation of max g P(zq ,Xq \ g) (the summation/ 

maximization over x^\u k is done in a later stage), we choose 
to work in the log domain with the metric 

-logP(z 0 ",< I g) = -\ogP(z" 0 I x n 0 ,g)P(x" 0 ) = -logn jV(z m ;^g;iV 0 )/7(xJ 

_ t a (2D 

where N(z;m;G 2 ) denotes the probability density 

function of a complex circular Gaussian random variable with 
mean m, and variance a 2 /2 for the real and imaginary parts. 

The constant term c" is dropped and the metric is slightly 

generalized by introducing an exponentially decaying window 
relative to time k with forgetting factor p. This weighting 

provides increased numerical stability as well as the 
ability to track slow parameter variations. Maximization 
over the unknown parameter gr, required in equation (20) , is 

equivalent to 

\z m -<l T mg\ 2 



A(z 0 \*S) = min|2; 



N 0 



logP(x m ) 



m=k+\ 



l*« q » 8 \ 2 -\ogP(x m ) 



N 0 



P 



(22) 



10 



15 
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The key to deriving the exact expression for A(z^;Xq) is 
the fact that the solution g to the least squares 
minimization problem in equation (22) may be written in 
terms of a forward g k and a backward g b M estimate, depending 

on the observation z\ and z n k+x , respectively. The final 
result is summarized in the following set of equations 

(23a) 

(23b) 



A(^;^) = A(4;^) + M*(^,^ + ,,^,) + 6(-) 



T x |2 



A(z k 0 ;x k 0 ) = pA(z k 0 ->;4- l )-\o f >P(x k ) + f 



A (4 + i>s M ,x" M ) = pA (z" k+2 ;s M ,x n k+2 )-\ogP(x k+] )+ 



p \z k «-qLgL\ a*;) 



pWkJLifi M n 0 



where g k ,g b k+i ,P k and P k \ x are the forward and backward 
Recursive Least Squares (RLS) estimates and corresponding 
information matrices, respectively. Both the forward and 
backward RLS estimates defined above are implicit functions 

of the input sequence x% and x n M , respectively. Finally, 
the binding term £>(•) is given by 1 

(23d) 



1 










2 


^0 








gk+gc 





gc=~P k 



p +—p b 

r k ^ r k+l 

p 



-i-l 



g k --pL 
p 



r k ^ r fc+\ 



gk+\ 



(23e) 



where \\x\\ A =x*Ax,x* denotes complex conjugate and 
transpose, and g c is the smoothed Least Squares estimate 
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Note that the recursions in equation (23) , which is a 
novel way of calculating the residual metric of a RLS 
smoother, are very similar to those of the standard forward- 
backward SISO technique, with the exception of the existence 
of a binding term in equation (23d) . However, the forward 

recursion depends on the entire path history %l and the 

backward recursion depends on all the entire conditioned 

future ^ +1? x£ +1 . Thus, exhaustive combining and 

marginalization is required to compute the ASI (i.e., 

equation (20)) when parametric uncertainty is present. 
However, the bi-direction form of the recursions in equation 
(23) is useful for motivation of practical approximations. 

B. Exact evaluation of the ASI for a memory-less mapping: 

The A-SODEM technique 

The ASI of the subsystem including the modulator and 
the channel, namely the A-SODEM, may now be derived. The 
starting point of the derivation of such a technique is the 
observation equation (19) , specialized to g k =f(x k ) . Note 

that this is a special case of the FSM model assumed above. 
Thus, the results in equation (23) are applicable. Note 
that, even though the mapping is memory-less, the ASI 
requires memory (i.e., exhaustive combining and 
marginalization) due to unknown parameter gr. 
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PRACTICAL ASI TECHNIQUES FOR PHASE TRACKING IN TURBO CODES 

As may be seen from FIGS. 1A and 2A, the encoded 
complex symbols q k are transmitted to an AWGN channel, which 

introduces phase offset 6 k , as well, resulting in the 
following complex baseband model 



where E el is the symbol energy, and the symbols are 
normalized to unit energy. 

A. Practical A- SI SO techniques 

It was demonstrated above that the main components of 
the optimal A-SISO technique described in equation (23) for 
the specific modeling assumption for the unknown parameter 
are: (i) the metric update mechanism on a forward and a 
backward tree, (ii) the sequence conditioned forward and 
backward RLS estimators, and (iii) the binding of the 
forward and backward metrics. Since each of these 
components is either associated with extreme complexity, or 
is tightly related to the specific model, several options 
are discussed for the design of practical A-SISOs. 

Regarding the complexity associated with the tree 
search, among the possible solutions, one desirable sub- 
optimal solution emerges when force-folding the tree into a 




(24) 
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fixed complexity trellis, and applying the Viterbi Algorithm 
(VA) to update the metrics. In addition, the Per-Survivor 
Processing (PSP) principle, or the Decision Feedback (DF) 
assumption may be applied as a mechanism to update the 
parameter estimates. Furthermore, the RLS per-path 
estimators may be substituted by per-state generic parameter 
estimators, suited for the specific parameter model. 

The resulting technique, shown in FIG. 6, and described 
in Appendix B, include forward and backward recursions 
similar to the ones performed in the forward-backward SISO 
for perfect CSI, aided by multiple (i.e., one for each 
state) parameter estimators. Each recursion requires Add- 
Compare-Select (ACS) or Product-Sum (PS) operations to be 
performed on a state trellis for MSM and APP soft metrics, 
respectively, as well as parameter updates. 

The trellis on which this technique operates is not 
tightly related to the FSM trellis. In fact, the notion of 
the state s k and transition t k may be generalized to longer 

sequence portions leading to expressions identical to 
equation (23) . As an example, a superstate and super- 
transition may be defined as s s k = {t k _ K9 ... 9 t k _ X9 s k ) and t s k =(t k _ K ,... 9 t k ) 
for arbitrary K. The term K is a design parameter that 

determines the amount of pruning in the forward and backward 
trees, and eventually, the complexity of the technique. 
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This technique has been used extensively in the problem of 
data detection in the presence of unknown parameters. At 
the limit, when K is equal to the entire sequence length n, 

the corresponding A-SISO exactly evaluates the APP, MSM 
5 metrics in equations (20a) and (20b) . 

By inducing a delay (advance) in the decision feedback 
of the forward (backward) recursion, single estimator A- 
SISOs may be derived resulting in the structure shown in 
FIG. 6. Forward and backward recursions are performed on 
10 the FSM trellis, aided by a single forward and backward 

parameter estimator. The required tentative hard or soft d- 

delayed (d-advanced) decisions are provided by either a d- 

step hard-decision trace back, or a d-step backward 

(forward), non-adaptive recursion on the state trellis. 
15 There are two ways that the framework established in 

Theoretical Framework Section to motivate practical ASI 
modules for the case of the phase tracking may be used. One 
approach is to modify the observation equation in equation 
(24) to obtain a linear model as in equation (19) using 

20 g~e j6k , where g is a complex amplitude parameter. With this 

modification, instead of estimating the physical parameter 

9 k , the complex amplitude gr is estimated. Another approach 

is to maintain the non-linear observation equation and 
replace the parameter estimators in the A-SISO or A-SODEM by 
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some non- linear estimator. For example, an Extended Kalman 
Filter (EKF) or a Phase-Locked Loop (PLL) may be used. In 
the present disclosure, the latter approach is considered. 
Regarding the channel estimator, a simple first order 
Decision Directed PLL (DD-PLL) is used in place of the RLS 
technique with noise equivalent bandwidth (normalized to the 
symbol time) B eq ~ A/(4-2A) , 

e k =9 k _^n{z k q k e- j ^) (2 5) 

and similarly for the backward estimate 0% +l . 

In addition, it is desirable to reduce the complexity 
in the forward equation (23b) , backward equation (23c) , and 
update equation (23d) . One such simplification leads to the 
following approximations 

\z k -q k e A, \ 2 



A(z*; <) = A(z*-' ;<-)- log P(x k ) + J 



N 0 (26a) 



A A (z; +1 ;^ +1 ,< + ,) = A i (z; +2 ;^ +2 ,< +2 )-logP(x, +1 )+ 1 ^ 1 q *f** (26b) 
\-X \e J§l -e M \ 2 (26c) 



b(0 k ,6° k ) = - 



X{2-X) N Q 

We note that this last equation is not exact; it was 
obtained intuitively by observing the properties of equation 
(23d) . The above equation provides additional insight on 
the role of the binding term: If the forward and backward 
channel estimates corresponding to a particular sequence are 
not consistent, a penalty is paid by means of increasing the 
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sequence metric. Furthermore, this penalty is amplified 
when tracking slowly changing parameters (X close to 0). 

B. Practical A-SODEM techniques 

Having establish the parallel between A-SODEM and A- 
SISO, namely that the A-SODEM is an A-SISO operating on a 
specially defined FSM, simplifications discussed in the 
previous section may be applied in this case as well. In 

particular, defining the super-state s s k ={x k _ K ,...,x k _ x ) f a family 

of practical techniques may be derived in a straightforward 
way having structure similar to the A-SISOs shown in FIG. 6, 
including forward and backward processing with per- state 
parameter estimation. 

The simplest case of K = 0 may be of particular 
interest, since this will provide useful links with existing 
adaptive techniques. Under this simplifying assumption, the 
FSM trellis reduces to a single-state trellis with N x 

parallel transitions. The A-SODEM technique is described by 
a set of forward/backward and binding equations, where the 

dependency of the forward A(z£) and backward A b (z k+} ) metrics 

on the input sequence {x k } may be dropped. Furthermore, 

concentrating on the forward recursions, the values for - 

or equivalently x^ - required in equation (25) by the DD-PLL 
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to update 6 k , are provided by instantaneous decision 



feedback based on the current metrics A(z*) - The resulting 
family of A-SODEMs is described in Appendix C. 

Different variations of the technique described in 
Appendix C (i.e., no binding and forward-only recursions are 
such variations) may also be envisioned. An additional 
option is to hard quantize the new a-priori information 
P(x k ) to generate hard estimates x k (or equivalently q k ) . In 

this case, the metric update step 1) in Appendix C 
simplifies to 

Metric update : a k+l =a k + \z k -q k e j3k - 1 f 

and q k is used in the DD-PLL to update 0 k . Similar 

simplifications are performed in the backward step 2) . If 

the a-priori information P(x k ) is not utilized at all, the 

above phase estimation technique is a non- iterative scheme 
which needs to be activated only once, before the iterative 
decoding process . 

SCCC WITH CARRIER PHASE TRACKING 

As shown in FIG. 2A, in a SCCC the sequence of source 
bits Jb n , is partitioned into blocks and convolutionally 

encoded using a rate R 0 outer CC, producing N coded symbols 

x n . These symbols are fed to an inner CC of rate R ± through 
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a pseudorandom symbol interleaver of length N. The output 
symbols y k are mapped onto a constellation of size Q, 

resulting in an overall code rate of R = R 0 R i log 2 Q (bits per 
channel use) . The complex symbols q k are transmitted 

5 through an AWGN channel, resulting in the complex baseband 
model of equation (24) . 

The effectiveness of the adaptive iterative detection 
technique may be assessed by a number of factors . For 
example, loss of lock probability, tracking bandwidth and 
10 BER in the tracking mode are all relevant performance 

measures. Initial experiments suggested that cycle slipping 
was a major performance limiting factor. This is because 
the operating SNR is very low and the block length 
(interleaver size) is large. Thus, the insertion of pilot 
is symbols may be considered. 

In particular, N t pilot symbols are inserted in the 

transmitted sequence for every N d coded symbols. The energy 

lost in the redundant pilot symbols is accounted for by 
lowering the transmitted symbol energy as 



20 



E^RR.E^R^log.Q 



N d + N t 



(27) 



where E b is the energy per information bit. 
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A. Receivers 

The structure of a SCCC is one of a serial 
concatenation of two FSMs through an interleaver and 
therefore it permits the iterative receiver shown in FIG. 2B 
for the case of perfect CSI. Similarly, the block diagram 
of the iterative receiver that utilizes an A-SODEM and inner 
and outer non-adaptive SISOs is shown in FIG. 2C, while the 
A-SISO based receiver is shown in FIG. 2D. 

Utilizing the DD-PLL estimator of equation (25) , the 
forward/backward equations, and the binding term in equation 
(26) , A-SISO-or A-SODEM-based receivers are constructed and 
their performance under different operating scenarios is 
simulated in the next section. A baseline adaptive decoder, 
that may be derived for PCCCs, is also considered. It 
includes a single DD-PLL which uses decisions on the raw 
output symbols g k , as well as the pilot symbols, to obtain a 

phase estimate and consequently derotate the observation; 
however, no feedback information on y k from the inner SI SO 

is utilized. A standard iterative decoder is then employed 
on the derotated observation - after discarding the pilot 
symbols - to produce final decisions on the source bits. 
Note that, in such a segregated system the channel estimator 
(DD-PLL) does not use any information on the structure of 
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the output sequence {y k } due to the underlying code and is 
run only once prior to the initial iteration. 

E. Numerical Results 

The SCCC system is simulated in this section. It 
includes an outer 4 -state, rate 1/2 RSC connected through a 
length N = 16384 symbol pseudo-random interleaver to an 
inner 4 -state, rate 2/3 RSC. The corresponding generator 
matrices are given by 



G outer (D) = 



l 1 + Z) 2 
1 + Z) + Z) 2 



G inner (D) = 



IOt^ 



01- 



l+D+D 2 
\+D 



l+D+D 2 . 

The output symbols are mapped to an 8PSK constellation 
with Gray encoding, resulting in an overall code rate £=1/2 

x 2/3 x log 2 8 = l. The phase process is generated as a random 
walk 

is 0*=0*-i+A (28) 

where <f> k is a Gaussian increment of zero mean and 

variance cr 2 . Only APP-type SISOs are considered here since 

the SNR loss of 0.5 dB to 0.7 dB experienced by the MSM-type 
techniques is crucial in this application. In addition, the 
20 A-SISO trellis size is the one implied by the corresponding 
FSM, i.e., no super-state-based A-SISOs are examined. The 
receivers including the inner A-SISOs are labeled as A-SISO- 
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SING/MULT -SB/NB, corresponding to single or multiple DD-PLLs 
and sub-optimal binding of equation (26c) , or no binding 
respectively. Among the A-SODEM-based receivers two special 
cases are considered: (i) the single-state A-SODEM described 
in Appendix C (labeled A-SODEM) , and (ii) a single-state A- 
SODEM variant with forward-only recursions and no binding 
(labeled A-SODEM-FW) . Finally, the baseline technique 
including a single external DD-PLL operating on the raw 8PSK 
symbols will be labeled EXT (i.e., external PLL) . In all 
simulations presented here, the initial and final phase 
estimates are assumed ideal. Consequently, for a fair 
comparison between the External PLL receiver and the 
proposed receiver structures, a forward PLL starting at the 
beginning of the block is used to derotate the first half of 
the observation, while a backward PLL starting at the end of 
the block is used for the second half of the observation. 
With such a scheme, the knowledge of both the initial and 
the final phase is utilized by the External PLL receiver. 
Note that interpolation between phase estimates obtained 
using the N d ~ separated pilot symbols was found to perform 

poorly under all operational scenarios presented. 

In FIG. 13 the BER is plotted versus the loop bandwidth 
B eq for the case of the true phase process being static 

(a>=0). A large value of B eq suggests the ability to track 
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larger phase dynamics. The comparison of EXT and A-SISO- 
MULT-SB curves leads to different conclusions depending on 
the bandwidth range. In the low loop-bandwidth range 

(B^^IQT 3 ) the tw ° receivers perform substantially 
identically, approaching the perfect CSI performance, thus 
the External PLL receiver suffices. For medium and high 

loop-bandwidth (B eq <l0~ 3 ) a clear advantage of the A-SISO- 

MULT-SB may be observed over the EXT receiver. In 
particular, the simulations show that with the proposed 
technique the PLL bandwidth may be increased two to three 
times . 

Regarding the comparison between pilot -aided and non- 
pilot-aided, the basic trade-off is controlled by the 
parameter N t (for fixed N d ) by increasing N tf better tracking 

is possible, while the symbol energy E s is reduced as 

reflected in equation (27) . In the one extreme, no pilots 
are introduced (N t = 0) , resulting in high probability of 

cycle slipping at moderate phase dynamics. In the other 
extreme (R t 1) , the SNR loss nullifies any performance gain 
due to the improved phase estimate. Two practical cases are 
shown in FIG. 13: non-pilot-aided and {N t , N d ) = (32 , 256) 

pilot-aided transmission. At low E b /N Q (i.e. 1 dB) the non- 
pilot -aided system is superior since pilots introduce an 




Attorney Docket No. 




PATENT 

666/060001/USC-2990 



energy loss of -101og 10 R t = 0.51 dB, reducing the effective 
E s /N 0 to 1-0.51=0.49 dB, which results in poor performance 

even in the coherent case. At medium E b /N 0 (i.e., 1.5 dB) 
the trade-off is reversed, generating a two to three-fold 
advantage of the pilot -aided system over the non-pilot -aided 
one in terms of B eg . This behavior is attributed to the 

fact that the former system is able to maintain phase lock 

for wider loop bandwidths . Finally, at large E b /N 0 values 

(i.e., 2 dB) , the superiority of the pilot-aided system is 
even more evident, giving rise to as much as five to seven 
times increase in B eq , and achieving even lower BER. The 

above comparisons raise the issue of proper selection of the 
system parameter (N t ,N d ) . The design procedure is initiated 

by setting a target BER and B eq region. A search procedure 

is then followed, in the process of which, E b /N 0 and N t are 

gradually increased until the target (BER, B eq ) pair is 

reached. Regarding the selection of N df it should be 

smaller than the average time-to-slip or else the 
performance will be dominated by cycle slips. 

FIG. 14 shows a comparison of the SCCC system with the 
industry standard rate 1/2, 128-state CC, in the more 
realistic scenario that includes phase dynamics. The CC 
output is mapped on a QPSK alphabet resulting in a rate R =1 
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(bits per channel use) code (no pilot symbols are used) . 
MLSD with the aid of a VA is performed in the coherent case, 
while two adaptive receiver structures are considered. The 
first is the Conventional Adaptive-MLSD (CA-MLSD) receiver 
including a single DD-PLL driven by delayed tentative 
decisions from the VA. The second is a PSP-based receiver 
having a VA with 128 DD-PLLs driven with zero-delay 
decisions. The SCCC receivers considered are the EXT, A- 
SODEM, A-SODEM-FW, A-SISO-MULT-SB, A-SISO-SING-SB, and A- 

SISO-SING-NB. Simulations were, run for o^=2° and B eq was 
optimized for each E b /N 0 value. Examining the CC 

performance curves, the following observations may be made. 
With perfect CSI, BER of 10" 5 is achieved at E b /N 0 =3.75 dB. 

The PSP-based receiver operates at this BER with a loss of 
0.4 dB, while the CA-MLSD receiver performs poorly resulting 
in a BER of 10" 2 at 4 dB. 

The design procedure outlined in the previous 
paragraphs was followed for the selection of N t for the SCCC 
case. Simulation trials' not shown here suggested that a 
reasonable pair is (N t ,N d ) = (16 , 256) for a target BER of 10" 5 

and the mentioned phase dynamics. Observing the SCCC curves 
in FIG. 14 it may be concluded that A-SISO-MULT-SB and A- 
SISO-SING-SB (d=0) perform identically (0.6 dB away from the 

coherent case) . This may be attributed to the fact that the 
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former corresponds to an FSM of only 4 states. Therefore, 
there is no notable gain by using four PLLs instead of one 
PLL. This is to be contrasted with the 128-state CC case, 
where a large difference between the CA-MLSD and PSP-based 
5 decoders is observed. A small degradation of 0.2 dB is 
observed when binding is dropped in the A-SISO receiver. 
The A-SODEM receiver has very similar performance with the 
A-SISO-MULT-SB and A-SISOSING SB receivers, even though it 
corresponds to no apparent memory, while the simplified 
10 version A-SODEM-FW, results in a performance loss of 1 dB 
compared to the best adaptive receivers considered. 

Regarding the performance of the proposed adaptive 
schemes, it may be noted that since the SNR loss due to the 
insertion of pilot symbols is -101og 10 J? t =0 . 27 dB, the actual 

is loss due to the unknown phase is only 0.33 dB (for the best 
adaptive scheme, i.e., A-SISO-MULT/SINGSB) . This means 
that even if the state space is increased (by using super- 
state-based ASIs) , the expected performance gain is very 
small (at most 0.33 dB) . As a result, in this particular 

20 application, the AS1 techniques built on the original state 
trellis suffice. 

The comparison of the CC and SCCC curves illustrates 
the importance of adaptive iterative detection. Under 
perfect CSI, the SCCC performs with a 2.6 dB gain over the 

25 standard CC. This gain vanishes when a PSP based MLSD 
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receiver is used to decode CC and the EXT receiver is used 
for SCCC. By utilizing the more advanced A-SISOs or A- 
SODEMs proposed here, together with pilot symbols, the 
corresponding gain is increased to 3 dB. 

PCCC WITH CARRIER PHASE TRACKING 

In a PCCC, a length N block of the original sequence x k 
is encoded by a rate R x CC, while an interleaved version of 
the input sequence is encoded by a second CC of rate R2, 
giving rise to the coded symbols and yf ) , respectively. 

The output symbols y^ and y[ 2) are then mapped to the 
symbols q k and transmitted over an AWGN channel which 

introduces phase uncertainty, modeled exactly as in the case 
of SCCCs . The mapping may be done after puncturing. The 
observation equation is written as 



where the time-varying mapping q k =f k (y k \y k ) is 
explicitly shown. Pilot symbols are inserted in the 
transmitted sequence in the same manner described in the 
previous section. 

For concreteness, we consider the case where y^=(x k c^) 
includes a systematic and a coded bit (R x = 1/2), j/£ 2) =c£ 2) is 
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a binary coded bit {R 2 = 1) and q k belongs to a QPSK signal 

constellation. This signaling format, which was the basis 
for the original turbo code, may be achieved by transmitting 
the systematic bit x k , together with the coded bits c£ 1} or 

c£ 2) after alternate puncturing. 



where QPSK(-,«) maps the bits to the two-dimensional 
QPSK signal constellation (e.g., using Gray mapping). 

A. Receivers 

Since PCCCs may be modeled as parallel concatenated 
FSMs, the iterative decoder shown in FIG. IB may be applied 
when perfect CSI is present. The adaptive receiver having 
an external DD-PLL operating on the coded symbols q k , 

followed by a non-adaptive turbo decoder, is a potential 
solution when knowledge of the phase offset is not available 
at the receiver. The A-SODEM-based receiver is shown in 
FIG. 1C. Finally, for the A-SISO-based receiver, contrary 
to the serially concatenated examples considered above, the 
PCCC has the property that the outputs of both FSMs are 
directly affected by the channel. Furthermore, the outputs 
of the constituent FSMs are coupled via the nonlinear 
mapping of equation (29) . This makes the substitution of 
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the perfect-CSI SISO by an A-SISO insufficient for 
performing adaptive iterative detection in this case. Thus, 
adaptive iterative detection for this PCCC application 
requires a method for evaluating transition metrics and 
updating phase estimates for each A-SISO. One specific 
approach to the method is presented below. 

Metric Evaluation: Metric evaluation in A-SISO may be 
performed by treating the output symbols corresponding to 
CC2 as nuisance parameters and either averaging or 
maximizing over them. Since APP soft metrics are proven to 
be superior compared to MSM ones (for the particular 
application in the perfect CSI case) , averaging over the 
output symbols of CC2 seems to be a preferable choice. For 
example, the simplified transition metric used in equation 
(26a) and in Appendix B for the forward recursion in SIS01 
is evaluated as 



where yy is an implicit function of t*. A reasonable 

choice for the probability P(y^) is to use the most recent 

soft-metrics produced by A-SIS02. This is identical to the 
operation of soft mapper (SOMAP) in the case of perfect CSI 
The only difference is that the demodulator and the SOMAP 
are now integrated with the A-SIS01, since a phase estimate 
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is required for this operation. This solution is both 
simple to implement, and compatible with the notion that 
SISO blocks exchange information only in the form of soft 
metrics. A similar procedure may be followed for the 
evaluation of the transition metrics of A-SIS02. Parameter 
Estimate Update: Several options are considered for updating 
the phase estimate in A-SISO. 

Starting from the simplest solution, the channel update 
in A-SIS01 is only performed for those time instants k, for 

which the symbol q k is only a function of (k is even) . 

The resulting updates for this punctured DD-PLL become 



A block diagram of the operations required to complete 
one step in the trellis is shown in FIG. 15. In particular, 
the transition from state s k to s k+1 is shown, for the 

forward recursion of A-SIS01, where multiple estimators (one 
per each FSM state) are assumed. For the phase estimate 

update, an estimate q k of the raw symbol q k is required for 

the DD-PLL, and is obtained by utilizing information about 

x kf c£ 1} and c£ 2) . In the case of punctured DD-PLL, only 

and c£ 1} are required (as shown in equation (32)), and are 
completely specified by the state transition t k . The 
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immediate consequence of this sort of channel update is a 
loss of the full tracking ability of the estimator (i.e., 
the effective loop bandwidth is halved) . In addition, such 
an approach is not always applicable, since the mapping f k {-) 

may always be an explicit function of the symbol c£ 2) , as in 
the case of non-punctured codes. 

In a more refined technique, the channel estimator - 
and in particular the DD-PLL (or DD-PLLs) - is updated for 
every time instant k. Referring to FIG. 15, as in the 

previous case, x k and are determined by the state 

transition of A-SIS01, while an estimate c[ 2) of. c£ 2) is 
determined by hard quantizing the most recent soft 
information of c£ 2) available either from A-SIS02 or from any 
other soft block in the adaptive receiver. 

The resulting updates for this parallel DD-PLL become 



Finally, an even more sophisticated technique may be 
derived by utilizing a mixed-mode PLL. Such a PLL operates 

in a decision directed mode in terms of the symbol 

= ( x k> c k ) ) ' while it effectively averages out the symbol 

4 2) . Hybrid schemes that use a punctured PLL initially and 



0 k ^+^{z k QPSK(x k9 c^)*e' je ^} *even 
^ O^+^z^SKix^ye'^} kodd 
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switch to a parallel decision-directed operation are also 
possible. 

In the following, the first order DD-PLL and sub- 
optimal binding term in equations (25) and (26c) will be 
used. Transition metrics are evaluated by averaging out the 
symbols corresponding to the other FSM as described in 
equation (31) . Finally, a hybrid approach for phase 
tracking is used. Specifically, A-SISOl is run with the 
punctured DD-PLL of equation (32) on the initial iteration, 
and switches to the parallel decision-directed mode of 
equation (33) in the subsequent iterations. The rational 
behind this hybrid bootstrapping procedure is that in the 
first iteration, there are no soft (or hard) decisions 

available for the symbol . The activation schedule for 
the iterative detector is shown in FIG. ID and described as 
follows: A-SISOl (with internal SOMAP and demodulator) -> 
SOBC (which effectively forwards extrinsic information form 
A-SISOl to A-SIS02) -> A-SIS02 (with corresponding internal 
SOMAP and demodulator) -> SOBC in the opposite direction -> 
A-SISOl, etc. 



B. Numerical Results 

An overall rate R=l code is considered in this section, 
constructed by concatenating two identical 4 -state Recursive 
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Systematic convolutional Codes (RSC) , and using a size N = 
16384 pseudorandom interleaver. Both the systematic and the 
encoded bits are output from the first code, while only the 
encoded bit is output from the second. The corresponding 
generator matrices are given by 



{ 1 + D 2 
l + £> + Z) 2 



G 2 (D) = - 



1 + Z) 2 



X + D + D 1 

The output symbol is formed as described in equation 
(12) . In FIG. 16, performance curves similar to those of 
FIG. 14 are presented. The conclusions are similar to the 

10 SCCC case, with the only difference being the slight 
degradation of the A-SISO-SING-SB and A-SISO-SING-NB 
techniques over the A-SISO-MULT-SB receivers. In addition, 
the A-SODEM-FW performance is very close to the performance 
of the A-SISO-based receivers, and the A SODEM technique 

15 results in slightly better performance compared to A-SISOs. 
Also, as in the case of perfect CSI, the quantitative 
performance achieved using the SCCC and PCCC systems is very 
similar. Finally, simulations for the case of static phase 
revealed comparable performance with that shown in FIG. 13. 



CONCLUSION 

Iterative detection may be viewed as the exchange of 
soft . information between "soft inverses" of each subsystem 
in the concatenated network, which combine and marginalize 
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this information. In one aspect of the present disclosure, 
the soft inverse of a system with parametric uncertainty 
present was developed. The adaptive soft inverse of an FSM 
(i.e., the A-SISO) was the particular focus. It was 
demonstrated how an A-SISO may be used to perform AID with 
numerical results given for the TCM-ISI serially 
concatenated system. It was found that qualitative 
conclusions regarding performance (e.g., the iteration gain) 
are substantially different for the case with parametric 
uncertainty. In particular, iteration gains for the time- 
varying fading channel were considerably larger than the 
perfect CSI case. 

By deriving the techniques starting from a reasonable 
definition for soft outputs, several classes of practical 
adaptive forward- backward techniques were obtained. The 
resulting set of practical techniques is intuitive 
combinations of forward/backward SISOs for perfect CSI and 
adaptive hard-decision techniques. Thus, several similar 
approaches may be suggested based on existing hard-decision 
techniques and/or SISO architectures. For example, the 
class of techniques for the linear Gaussian fading channel 
that utilize steady-state finite-memory estimators may 
readily be adapted to a forward- backward SISO using the 
framework developed (i.e., the binding term). Similarly, 
generalization of AID to other activation schedules and 
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architectures (i.e., FL SISOs, parallel message passing 
architectures, etc.) may be envisioned. 

For the detection of turbo codes (i.e., SCCCs and 
PCCCs) and phase tracking for the practical scenarios 
examined, it was shown that pilot-symbol-assisted adaptive 
iterative detection is effective for maintaining the near- 
Shannon- limit performance previously demonstrated for known 
phase systems. Incorporating the estimation process into 
the ID process gives advantages similar to those documented 
in adaptive hard decision literature, where joint estimation 
and data detection is considered for an isolated system 
(e.g., PSP-based phase tracking yields a factor of 2-3 in 
loop bandwidth extension for trellis-codes) . 

It may be possible to construct a more efficient SCCC 
using a QPSK constellation (e.g., by puncturing the outer 
and/or inner code) . In fact, for a channel utilization of 1 
bit per channel use at low E b /N 0f a QPSK constellation is 

adequate for achieving capacity. Also, potential further 
improvement for the SCCC may be achieved by the development 
of rotationally invariant possibly multidimensional -inner 
codes. The use of such codes may alleviate the detrimental 
effects of cycle slipping, potentially enabling even wider 
loop bandwidths . 

Decoding of PCCCs with phase tracking reveals that the 
concept of adaptive iterative detection is broader than the 
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concept of ASI techniques (A-SISOs or A-SODEMs) . Although 
practical receivers were proposed based on the latter 
structures, the development of a general framework for 
adaptive iterative detection on arbitrary networks based on 
the theoretical framework is an area for future research. 

While specific embodiments of the invention have been 
illustrated and described, other embodiments and variations 
are possible. 



All these are intended to be encompassed by the 



following claims. 



